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Serial No. 10/059,421 
Docket No. YOR920010126US2 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 
In re Application of 

Shibuya, et al. 

SerialNo.: 10/059,421 Group Art Unit: 2168 

Filed: January 3 1 , 2002 Examiner: Ly, Cheyne D. 

For: SYSTEM AND METHOD FOR IDENTIFYING GENES 

Honorable Commissioner of Patents 
Alexandria, VA 223 1 3-1450 

STATEMENT OF SUBSTANCE OF INTERVIEW AND 

SUPPLEMENTAL RESPONSE 

Sir: 

Comes now Applicant and for his Statement of Substance of Interview and Supplemental 
Response, states as follows: 

REMARKS 

Applicant gratefully acknowledges the comments of Examiner C Dune Ly 3 and 
Supervisory Examiner Tim Vo which were provided to Attorney Phillip Miller, Inventor Isidore 
Rigoutsos, at the Personal Interview that was conducted in the above-identified Application on 
June 1 4, 2006. At the interview, Applicant explained the Rigoutsos '99 reference on which the 
Examiner relies, and presented the arguments detailing how Rigoutsos '99 does not teach or 
suggest the claimed invention (e.g., claim 1). 

Further, in supplement to the Amendment filed herein on April 4, 2006, Applicant states 
the following: 
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2 



1- 



Rigoutsos 



v 99 does not teach or suggest a "system for identifying genes 



if 



Applicant noted that claim 1 recites a "system for identifying genes". However, 
Rigoutsos '99 has nothing to do with "identifying genes". Indeed, this is clear from the title of 
Rigoutsos '99 which is "Building Dictionaries Of ID and 3D Motifs By Mining The Unaligned 
ID Sequences of... Genomes". That is, Rigoutsos '99 deals with building dictionaries of motifs 
(i.e., from amino acid sequences), not identifying genes. 

In fact, Rigoutsos T 99 discloses a process that maybe used to create a pattern database 
such as the pattern database used in the claimed invention. Indeed, Rigoutsos '99 is summarized 
on page 224, col. 1 which states **[wje report on the properties of the entries of these two 
dictionaries, the extent to which the ID seqlets have instances in the Protein Data Bank, and the 
ramifications from the induced coverage. We also present several entries of the generated 
dictionaries and discuss uses". As is clear from this summary, Rigoutsos "99 has nothing to do 
with 1) inputting a genomic DNA sequence, 2) translating an ORF of the DNA sequence into an 
amino acid translation, 3) locating in the amino acid translation occurrences of patterns from the 
pattern database, or 4) determining whether the ORF includes a putative gene. 

In fact, attached hereto is a flowchart which summarizes the steps described by Rigoutsos 
'99 in building the ID and 3D dictionaries. This flowchart clearly shows that Rigoutsos f 99 has 
nothing to do with identifying genes and clearly does not teach or suggest the claimed invention. 

2. Rigoutsos '99 does not teach or suggest "inputting a genomic DNA sequence" 

The Examiner relies on two simple words in the Abstract to support his position that 
Rigoutsos '99 teaches a DNA sequence. Specifically, the Examiner alleges that the phrase 
"genomic input" in line 6 of the Abstract necessarily implies DNA. 

However, Applicant would point out that the term "genomic input" is modified by the 
language following it. That is, the complete term used in the Abstract is "genomic input at the 
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level of amino add positions", which makes clear that the input in Rigoutsos '99 is not a DNA 
sequence (indeed, the purpose of the Rigoutsos '99 paper is to build a dictionary of motifs from 
amino acid sequences not from DNA sequences). 

Moreover, even assuming (arguendo) that the Abstract may be construed as disclosing a 
DNA sequence, the remainder of Rigoutsos '99 makes it abundantly clear that only amino acid 
sequences are being input, not DNA sequences. Indeed, for example, on page 225, Rigoutsos 
'999 states that 

"[ijhe database that we used as the input to Teiresias comprised the ORFsfrom the 17 
complete and publicly available archaeal and bacterial genomes. In particular, the database 
contained: 4289 ORFsfrom Escherichia coti with 1358990 [amino acids] ... " 

Indeed, nowhere in Rigoutsos ? 99 is the term DNA even used. Instead, Rigoutsos '99 
only refers to amino acid sequences. Thus, it is clear that contrary to the Examiner's allegations 
Rigoutsos '99 does not teach or suggest inputting a DNA sequence. 

3. Rigoutsos '99 does not teach or suggest translating an ORF of the input DNA 
sequence into an amino acid translation 

The Examiner's position here is unclear. The Examiner first indicates on page 3 of the 
Office Action that this feature is disclosed at page 224, col. 1, lines 6-40. However, later on page 
3 the Examiner concedes that Rigoutsos '99 "does not explicitly describe the limitation of 
'translates an open reading frame™, but implies that this feature is inherent from the disclosure on 
page 225, column 2. 

However, Applicant would submit that nowhere does col. 1 on page 224 or col. 2 (the 
"Database" Section) on page 225 teach or suggest translating an ORF of an input DNA sequence 
into an amino acid translation. Indeed, col. 1 on page 224 merely states that 1) a database 
containing the ORFs for the complete genomes of 13 bacteria were compiled, 2) the database was 
processed to produce a dictionary of seqlets, and 3) the seqlets were aligned in three-dimensional 
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space. Further, as noted above, the "Database" Section on page 225 merely states that: 

"ftjhe database that we used as the input to Teiresias comprised the ORFsfrom the 1 7 
complete and publicly available archaeal and bacterial genomes. In particular, the database 
contained; 4289 ORFs from Escherichia coli with J 358990 [amino acids] ... " 

Nowhere in these passages does Rigoutsos '99 even mention translating an ORF of an 
input DNA sequence into an amino acid translation. In fact, the Examiner seems to imply (in 
paragraph 7 on page 3 of the Office Action) that Rigoutsos f 99 discloses amino acid sequences, 
and that somebody had to have translated these amino acid sequences from a DNA sequence at 
some point, then Rigoutsos f 99 discloses translating a DNA sequence into an amino acid 
translation. 

However, the Examiner is missing an important point. It is not any old DNA sequence 
that is being translated in the claimed invention. Instead, the claim recites translating an ORF 
of the input DNA sequence into an amino acid translation. That is, it is an ORF of the input 
(e.g., query) DNA sequence under examination that is being translated. 

That is, even assuming (arguendo) that the amino acid sequences in Rigoutsos f 99 were at 
some point in time translated from a DNA sequence, nowhere does Rigoutsos '99 teach or 
suggest a system which inputs a DNA sequences and translates an ORF of that input sequence 
into an amino acid sequence, 

4. Rigoutsos '99 does not teach or suggest locating in the amino acid translation 
occurrences of patterns from the pattern database to determine whether 
the ORF includes a putative gene 

Applicant would point out that this element of the claimed invention may be considered 
to have two parts: 

1) locating occurrences of patterns from the pattern database in the amino acid 
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translation; and 

2) determining (e.g., based on whether the patterns occur in the amino acid translation) 
whether the ORF includes a putative gene. 

The Examiner alleges in the Office Action that this element is taught at page 228, col. 2, 
lines 4-1 1 in Rigoutsos f 99. However, this passage merely reads (in pertinent part) 

"[o]ne of the seqlets that are discovered when we process the input database is .... and is 

present in the following ten ORFs: Of these, gi_3328856 and gi_3329230 are annotated as 

Fe-S oxidoreductases* " 

That is, this simple passage clearly does not teach or suggest 1) locating occurrences of 
patterns from the pattern database in the amino acid translation, or 2) determining whether the 
ORF includes a putative gene. 

When asked at the interview where in this passage these features are disclosed, he 
Examiner stated at the features were disclosed "at other places" in Rigoutsos '99. Applicant 
respectfully requested that, in a next office action, the Examiner more clearly indicate the 
specific disclosure in Rigoutsos '99 on which the Examiner was relying as disclosing these 
features of the claimed invention. 

First, this passage clearly does not teach or suggest locating occurrences of patterns from 
the pattern database in the amino acid translation. Indeed, nowhere does this passage discuss 
"locating" anything, let alone locating occurrences of "patterns" in an amino acid translation 
(from the input DNA sequence). 

Further, the Examiner seems to misunderstand the term "putative gene". Indeed, the 
Examiner seems to imply that this passage in Rigoutsos '99 teaches identifying a putative gene 
from the mere disclosure that two amino acid sequences (gi_3328856 and gi_3329230) are 
annotated as oxidoreductases. This is clearly unreasonable, 

Indeed, an oxidoreductase is simply an enzyme that catalyzes the transfer of electrons 
from one molecule to another. An enzyme may be but is not necessarily a protein. 
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The Commissioner is hereby authorized to charge any deficiency in fees or to credit any 
overpayment in fees to Assignee's Deposit Account No. 50-0510. 



Date: 



McGinn IP Law Group, PLLC 

8321 Old Courthouse Road, Suite 200 
Vienna, VA 22182-3817 
(703) 761-4100 
Customer No. 21254 



r 



Respectfully Submitted, 





Phillip E. Miller, Esq. 
Registration No, 46,060 



CERTIFICATE OF FACSIMILE TRANSMISSION 

I hereby certify that the foregoing was filed by facsimile with the United States Patent and 
Trademark Office, Examiner C. Dune Ly, Group Ait Unit # 2168 at fax number 571-272-8300 
this day of >Jt 2006^ — . 

<2 

Phillip E. Miller 
Reg. No. 46,060 
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Input amino acid sequences from database containing ORFs for 
complete genomes of various bacteria, into the Teresias Algorithm 
(p. 224, col. 1, lines 6-7; col. 2, lines 32-35; p. 225, col. 2, lines 32-34) 



43 



Use Teresias Algorithm on the input amino acid sequences to 
discover seqlets (i.e., re-usable patterns) ini the amino acid sequences 
(p. 224, col 1, lines 23-24; p. 225, col. 1, lines 11-18) 



43 



Generate a ID dictionary of the seqlets discovered 
by the Teiresias Algorithm (p. 224, goL 1, lines 24-25) 



Identify all instances of the seqlets in a version of Protein Data Bank (PDB) 
by treating each of the sequences from the PDB as a query sequence and 
determining which of the seqlets from the ID dictionary are present in the query 
sequence (p. 224, col. 1, lines 26-31; p. 226, col. 1, lines 10-12) 



43 



For recurring seqlets (i.e., seqlets that appear ai least twice in the sequences 
from the PDB), extract corresponding structure fragments from the PDB and align the 
fragments in 3D space (p. 224, col. 1, lines 26-31; p. 226, col. 1, lines 12-16) 



I3_ 



Enter recurring seqlets having acceptable error in a 3D dictionary of 
seqlets together with the alignment of the respective structure 
fragments (p, 224, col 1, lines 31-35; p. 226, col. 1, lines 47-50) 
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